CDS
Accession Number | TCMCG075C29565 |
gbkey | CDS |
Protein Id | XP_017984738.1 |
Location | complement(join(4396015..4396052,4396146..4396320,4396823..4397017,4397923..4398033,4398120..4398332,4398916..4399023,4399112..4399253,4399340..4399907,4400016..4400880,4401947..4402636)) |
Gene | LOC18586451 |
GeneID | 18586451 |
Organism | Theobroma cacao |
Protein
Length | 1034aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018129249.1 |
Definition | PREDICTED: pumilio homolog 4 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | J |
Description | pumilio homolog |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] |
KEGG_ko |
ko:K17943
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0003674
[VIEW IN EMBL-EBI] GO:0003676 [VIEW IN EMBL-EBI] GO:0003723 [VIEW IN EMBL-EBI] GO:0003729 [VIEW IN EMBL-EBI] GO:0005488 [VIEW IN EMBL-EBI] GO:0097159 [VIEW IN EMBL-EBI] GO:1901363 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGTTACAGGCAGTAACATAGATATGCTACCAACTATAGATAATGGTTTAGAAAGACATGGTGGGAATTTGGAAGATAGTTTCACTGAGCTAGAATTGATTTTGCAAGCGCATCGTAATCAACAATTTGTAGGTCGTGAAAGGGATCTTAATATATATAGGAGTGGCAGTGCTCCACCTACAGTTGAGGGATCCTTGAGTGCTGTTGGTAGTCTTTTTGCTAATCCTGATTTTGGAGACATTAATGGCATAACTGCTGTTGCTGGTAGTAGTAGTAGTAGCAATAATGGAATGCTGTCTGAAGATGAGATACGCTCACACCCTGCATATCTTTCATATTATTACTCCCATGAAAACATAAATCCAAGGCTGCCTCCACCGTTGTTATCAAAAGAGGATTGGCGTGTTGCACAAAGGTTTCAGGCTAGTGGGTCTTCCCTTGGGAACATTGGGGACTGGAGAAAGAAGAAGTTGGTTGATGGCGGTGATAGTTCGTCCTTATTTTCAATGCAGCCAGGTCTTTCTGTACAACAAGAACAAAATGATTTGATGGAACTGAGGAATACCAATGCAAGGAATACATCTAGAAAAATGTCAGCTGAGTGGCTTGATAGAGGTTCAGATGGTTTGGTTGGGCTGTCTGGTACTGGGCTTGGTGCAAGGAGGAAGAGTTTTGCTGACATTCTTCAGGATGGACTTGATCGACCTGCCACCTTATCAGGCCATCTCTCACAGCCATCAAGTCGCAATGCTTTTAGTGATATGTTGGATGCAGCTAGCATTGCTGATCCCAGTCCACCAGGTTTTCATAATGCAGCAGAGTCCATAGAGAGCTTGCCTGCTGGGGTAGCTCGTCCAGGTGTGGTAGGAGTTCAGAGCCATGGTAAAACTACTTCTCACTCTTTTGCATCTGCTGTAGGTTCATCATTATCGAGGAGTACAACTCCTGAACCATATTTAGTTGGGAGGTCTTCTGGTTCTGGACTTCCTCCTGTTGGGAGCAAGGTTGGCCATGCAGAAAAAAAGAATATCATTGGATCTAATGTCCAAAATGGGCATTCTTCTGCTGTGACTGAACTTTCTGAAATTGGAGCTACATTATCTGGGTTGACCTTATCGAAAACTAGACATGCAGATGAGAATAGTCATATGCGGTCTCAGCTTCAGGTTGATCTGGATAATCAGCTAGATTTTTCATTCAATATGCCCAATGGTCATAATCAGAGTTTGCAGCAGCAATTCATTGACAAGTCCAGTGCTGAAAAGCTTGCATTTCCTACCAACCATATCGACTTGGCAAGGAAAAAGGGAATTGCACCTAATATTAATGCTTATAATATTAGTTCCAATGGACAAGTCAGCATTCCCAAAAGAACTTCCTCTTCTGCAGATCTTTACGCAAAAGTGCATCCTTCAGGCCTTGGAAGTTTGGAAGTATGTGATGTTGGCCATCCTAATGTGAATCTTGCAAACACAGATTTCATTGGCCAACTACCCAGTGCTTATTCTGTTAACCAGAAGTTGAATTCAGCGATTAAGAACCATTTAAATGCAGGTTCCCCTTTGACTGGTACTGGGGATAGGCAAAGTTTAAATAGAGCTGGAAATCAAGGGGCTGACCTTCTTTCTCCACTTATGGATCCTCGTTATATCCAGTACTTGCAAAGAACTTCTCAGTATGGGGCACGAGCTGCAGCTAGCCCTGATTCTCTGCTTTCTGGGAACTATGTTGGTACTCTGCATGGGGATTTGGATGGCCTTCAAAAAGCATACCTTGAGGCAATATTAGCTCAACAGAAGCAGCAGTATGAACTGCCACTTTTAGGTAAAGCTGCTGCTCTGAATCATGGCTATTATGGGAATCCCTCGTATGGTCTTGGCATGCCGTTTGCTGGAAATTCAATGGCAAATTCTGTACTCCCCTCTATTGGTTCTGGAAGTATACAGAATGATAGAACTGCACGTTTTAATTCAATGATGAGAACCTCAACAGGAGCATGGCCCTCAGATATTGGTAATAATGTGGATGGAAGATTCATATCATCTTTATTAGATGAATTTAAGAACAACAAGACTAGGTGTTTTGAACTCTTAGATATCATTGATCATGTTGTTGAATTCAGTACGGATCAGTATGGTAGTCGCTTTATTCAGCAGAAATTAGAAACTGCCACAGAGGAAGAGAAGACCAAAATATTTCCTGAGATTATTCCCCATGCTCGCGCTTTGATGACTGATGTGTTTGGAAATTATGTCATACAGAAATTTTTTGAGCATGGTACAGAAAGTCAAAGAGCAGAGTTAGCCAGTCAACTTACTGGTCATGTGTTGCCTCTCAGTCTTCAAATGTATGGTTGCAGAGTGATTCAGAAGGCTTTGGAAGTTGTTGGTGTGGATCAGCAGACTGGAATGGTGGCAGAGCTTGATGGTTCAATCATGAAATGTGTTCGTGATCAGAACGGTAATCATGTTATTCAGAAGTGTATAGAGTGTGTCCCTCAGGATCGAATTCTGTTTATCATATCTGCTTTCCATGGCCAAGTTGTCGCTCTTTCTACCCACCCTTATGGTTGTCGTGTCATTCAGAGGGTTCTGGAACATTGCGATGATGTAAAAACCCAACAAATTATTATGGATGAGATCATGCTATCTGTATGCACTCTGGCACAAGATCAATATGGGAACTATGTTATTCAGCATGTTCTTGAACATGGTAAACCACATGAGCGATCTGCTATTATCAGCAAGCTTGCAGGACAAATCGTGAAGATGAGTCAGCAGAAATTCGCTTCTAATGTTGTCGAGAAGTGCTTGACTTTTGGTGGGCCTGAGGAACGTCAAATTTTGGTGAACGAGATGCTTGGTTCTACTGATGAAAATGAGCCATTGCAGGCCATGATGAAAGATCAATTTGGAAACTATGTTGTGCAAAAGGTTCTTGAGACTTGTGATGATCGGAGTCTTGAGTTGATTCTCTCTCGAATCAAGGTACATTTAAATGCCCTGAAGAGGTACACTTACGGCAAACATATTGTTTCACGCGTTGAGAAGCTTATTGCAACTGGAGAAAGGCGCATAGGATTACTTTCGTCATTGGCCGCCTAA |
Protein: MVTGSNIDMLPTIDNGLERHGGNLEDSFTELELILQAHRNQQFVGRERDLNIYRSGSAPPTVEGSLSAVGSLFANPDFGDINGITAVAGSSSSSNNGMLSEDEIRSHPAYLSYYYSHENINPRLPPPLLSKEDWRVAQRFQASGSSLGNIGDWRKKKLVDGGDSSSLFSMQPGLSVQQEQNDLMELRNTNARNTSRKMSAEWLDRGSDGLVGLSGTGLGARRKSFADILQDGLDRPATLSGHLSQPSSRNAFSDMLDAASIADPSPPGFHNAAESIESLPAGVARPGVVGVQSHGKTTSHSFASAVGSSLSRSTTPEPYLVGRSSGSGLPPVGSKVGHAEKKNIIGSNVQNGHSSAVTELSEIGATLSGLTLSKTRHADENSHMRSQLQVDLDNQLDFSFNMPNGHNQSLQQQFIDKSSAEKLAFPTNHIDLARKKGIAPNINAYNISSNGQVSIPKRTSSSADLYAKVHPSGLGSLEVCDVGHPNVNLANTDFIGQLPSAYSVNQKLNSAIKNHLNAGSPLTGTGDRQSLNRAGNQGADLLSPLMDPRYIQYLQRTSQYGARAAASPDSLLSGNYVGTLHGDLDGLQKAYLEAILAQQKQQYELPLLGKAAALNHGYYGNPSYGLGMPFAGNSMANSVLPSIGSGSIQNDRTARFNSMMRTSTGAWPSDIGNNVDGRFISSLLDEFKNNKTRCFELLDIIDHVVEFSTDQYGSRFIQQKLETATEEEKTKIFPEIIPHARALMTDVFGNYVIQKFFEHGTESQRAELASQLTGHVLPLSLQMYGCRVIQKALEVVGVDQQTGMVAELDGSIMKCVRDQNGNHVIQKCIECVPQDRILFIISAFHGQVVALSTHPYGCRVIQRVLEHCDDVKTQQIIMDEIMLSVCTLAQDQYGNYVIQHVLEHGKPHERSAIISKLAGQIVKMSQQKFASNVVEKCLTFGGPEERQILVNEMLGSTDENEPLQAMMKDQFGNYVVQKVLETCDDRSLELILSRIKVHLNALKRYTYGKHIVSRVEKLIATGERRIGLLSSLAA |